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Abstract 

This paper describes the use of weighted Monte Carlo events to accurately approx- 
imate integrals of functions of the experimentally measured kinematical vectors 
and their dependence on physical parameters. This technique is demonstrated in 
estimating the evolution of cross sections, efficiencies, measured kinematical distri- 
butions and mean values as functions of the Trilinear Gauge Couplings. 



1 Introduction 



The accurate estimation of the Electro Weak parameters (in the following just parameters) 
at LEPII demands attention to the definition of the probability density functions (p.d.f.). 
Measurement effects such as the detector resolution and the selection efficiency have to be 
convolved with the physics functions in order to describe the distribution of the observed 
kinematical variables. Due to the fact that detector effects are not easily parametrized, 
one usually employs Monte Carlo (M.C.) integration techniques. The disadvantage of this 
method, though, is that the integrated quantities are in principle by themselves functions 
of the parameters (e.g. the cross section or the efficiency corresponding to a part of the 
phase space) and consequently several M.C. sets of events must be used to cover the 
parametric space. 

This paper describes a reweighting technique which covers a wide range of values of 
the parameters (extrapolation area) whilst using a single M.C. set of events. Further- 
more this technique can be extended by combining several sets of M.C. events, produced 
with different parameter values, and thus enlarging the extrapolation area in addition to 
minimizing the statistical error. 

The general principles of reweighting are demonstrated in calculating the cross sections 
in multidimentional bins of the observed kinematical variables and their dependence on 
the parameter values. This particular example has important applications in the shape 
definition of the kinematical distributions which are used in measuring the Trilinear Gauge 
Couplings (TGC's). Other estimating procedures compare the average of certain functions 
of the observed kinematical vectors with the phenomenological expectations convolved 
with the resolution and efficiency functions. It is shown, in this paper, that by employing 
the reweighting technique this convolution is performed accurately and a very efficient 
estimation of the TGC's can be made. This work finally concludes with a demonstration 
of the accuracy and consistency of the technique by numerical results. 

2 Reweighting a Single Set of M.C. events 

Throughout this analysis the symbol V = V\, . . . , Vjt denotes the k-dimensional kinemati- 
cal vector which defines completely a M.C. event at generation (generation vector) whilst 
Q = Qi, . . . , Qk stands for the reconstructed equivalent of V (reconstructed or observed 
vector). The following symbols are also defined: 

a = cki, . . . , a p denoting the p parameters which are required to define the p.d.f. 

da(V, a)/dV denoting the differential cross section as a function of the generated vector. 

R(V, Q) denoting the resolution function, i.e. the probability of an event generated with 
V to be observed with Q kinematical vector. Obviously [] 



e(V) denoting the probability of an event generated with V to be observed. 

1 Where the limits of an integration are not explicitly stated an integration all over the phase space is 
meant. 
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da(Q,a)/dQ denoting the differential observed cross section as a function of the recon- 
structed vector which is defined as: 

dcr(Q, a) f da(V, a 



j^l. e{V) . ma)dV ( 2) 



dn 

a bs((y) denoting the total observed cross section. 
g(V, a) denoting the p.d.f. defined as: 

(da(V,a)/dV)-e(V) 
g{V,a) = (3) 

Following the above definitions the joint distribution of the observed kinematical vectors 
is described by: 

G(Q, a) = J g(V, a) ■ R(V, tydV (4) 

2.1 The Number of Expected Events in a Part of the Phase 
Space. 

Let Afl be a part of the observed phase space and let the integral I to be the corresponding 
cross section 

i=l d -^§^dn ( 5 ) 

J An d\ I 

for a set a of parameter values. The number of the expected events, AQ), in the AQ 
part of the phase space for an accumulated luminosity L is: 

fi(a,An)=I-L (6) 

Let us assume that there is a set of No M.C. events generated with the p.d.f. g(V,d g ) 
with d g parameter values. Let also N be the number of the above events which have 
survived the acceptance, reconstruction and selection criteria after full detector simulation 
whilst /iAn of them lie inside AQ. By multiplying and dividing the integrant of (|5]) by 
the properly normalized p.d.f. 



9(V,<*g) 
D(An,d g ) 

the integral I is written as: 



(7) 



where 



v 91 J au J dV g(V,d g ) D(Att,d g ) v 1 

D(AQ,d g )= [ [ g(V,a g ) ■ R(V,Q)dVdQ (9) 

Equation @ can be written in a simpler form, by expressing the differential cross 
section da{V,a) / dV as a product of the matrix element squared A4{V,d) and of the 
Lorentz Invariant Phase Space dLIPS(V). Specifically: 

I = D(AQ, d g ) ■ cr obs (d g )- < w(V, a, d g ) > (10) 
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M(V,d) . 

< w(V, a, d g ) >= I I w(V, a, d g ) ■ dVdSl (12) 

Then, the integral (H) can be approximated by using the set of the haq, M.C. events as: 

Obviously the function D(AQ,d g ) can be estimated as: 

D(AQ,a g )^^f (14) 



with a binomial error \Juaq • (N — haq) /N 3 , which for N 3> uaq becomes y/nAn/N. The 
global term in ( |T3j ) cr bs(a g ) (independent of the particular Af2 interval) , can be written 
as a fraction of the total cross section a to t{c^g) in the form: 

N 

cr bs{a g ) ^ Vtot{a g ) • -rr (15) 
iV 



with a fractional error of yiV ■ (i\T — N))/Nq which for N ^> N becomes yN /N . Finally 
by substituting fll4j) and (p~5| ) into ( [13]) the integral I is approximated as: 



/^"f^a.aj (16) 

The accuracy of this approximation can be quantified in terms of the variance of 
w(Vi,a,d g ) and the statistical errors of (p]) and (|T5|). Thus the error in (|T6| ) is: 

5/ = — ^- [< w(V, a, ag) > -(71ao + -^p) + 

n An • (< w 2 ^, a, a p ) > - < w(V, d, d g ) > 2 )] 1/2 (17) 

The above approximation ( JIB| ) and its error estimation flT?D are valid for every value 
of a and d g as long as the p.d.f. defined in (^) is not zero inside the AQ interval. 



2.2 The Mean Value of a Function of the Observed Kinematical 
Vectors 

It has been shown that there are functions of the true kinematical vectors, Ok(V; do) 
with k — 1, . . . ,p which locally (around do) carry the whole information concerning the 
k th parameter. This is easily proven by expanding the p.d.f. in a Taylor series around an 
initial set of values do of the parameters and keeping only the linear terms. Furthermore 
the mean values of these functions, usually called Optimal Observables, for any d around 
do, can be expressed as linear functions of the parameters with known functions as slopes 
and intercepts. The situation becomes more complicated when one has to take into 
account detector effects. However, it can be shown [0] that in this case too, there exist 
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functions of the observed kinematical vectors which retain the same information content 
as the Likelihood estimators. In general this function u(Q;do) is defined as: 

(n -\ /Vorr/ g(V,ao)-R(V,n) ^ r , 1fi * 

a ) = J 0(V; a ) • J^yR^jdV ^ ^ 

In other words, the Optimal Observable in the most general case is the mean value of 
0(V; d ) where the distribution of the vectors V 's corresponds to d parameter values 
under the condition that the observed kinematical vectors are equal to Q. Equation fll8|) 
can be further simplified if expressed in terms of 0(Q) as: 

co(Q; d ) = 0(fi; d ) ■ r(fl; d ) (19) 

where r(Q;do) is the mean value of the Optimal Observables 0(V;do) (as defined in 
(|18|)) expressed in units of 0(0; do). The function r(f2; do) plays the role of a correction 
function which has to be calculated from the M.C. In practice (at least for the case of 
the TGC's) a very good approximation is: 

u{Q; d ) ~ d ) (20) 

The mean value of 0(Q; do) (in the following Modified Observable or M.O.) 

< 0(fl; ab) >a= J 0(Q; d ) J g(V, d) ■ R(V, tt)dVdtt (21) 

in a region around do is, as before, a linear function of the couplings d but in this case 
the slopes and intercepts are convolutions of known physics functions (matrix elements 
and phase space) and the resolution and efficiency functions. Nevertheless it is simpler to 
calculate the mean value < 0(fl; do) >« as a function of d (calibration curve) by using 
M.C. events with proper weights corresponding to the at parameter values. In parallel the 
set of the N meaSi vectors, fl meas ., accumulated in the real experiment is used to measure 
the mean value of the Modified Observable as: 

<0$l;a Q )> meaa .~-—- £ O{W meas -d ) (22) 

Finally an estimation of the couplings is achieved by comparing the experimental value 
( Pffi ) with the calibration curve Q. 

The reweighting procedure needed for the calculation of the calibration curve follows 
the same general principles as the method described in the previous subsection. As before, 
by multiplying and dividing the integrant of ( pTj ) by g(V,d g ) (d g being the parameters 
used for the production of the M.C. set of events) and expressing the p.d.f.'s in terms 
of total cross sections, squared matrix elements and phase space factors, eq. fl2"T| ) is 
transformed toQ 

< 0(ft; d ) > a = a ° bs ^ . f f 0(Q; d ) ■ w{V, d, d g ) ■ g(V, d g ) ■ R{V, Q)dVdtt (23) 



2 In the following the index k is dropped to simplify the expressions. 

3 It must be emphasized that although the method of defining the calibration curve is valid for every 
a, the Modified Observable is Optimal only in a narrow region around <5o- The estimation technique 
which extends the optimality to all the points of the parametric space is discussed elsewhere Q . 

4 Assuming that g(V,a g ) > 0, W. 
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This integral can be approximated as: 



< 0(Q; d ) > c5 ~ • EM^, a, d g ) ■ 0(Q t ; d ) (24) 

where the cr 6 S (a) was expressed as 

aid) N 

a obs (d) ~ — ' 9 -J2 w ( v hd,dg) (25) 
iV i=i 

The variance of the approximation (|24|) can be estimated in the standard way in terms 
of the variance, covariance and partial derivatives of fl24|) with respect to the "random" 
variables w(Vi, d, d g ) and w(Vi, d, d g ) ■ O(0.i, d ). 



3 Combining Several Sets of M.C. events 

In practice, the use of a limited size M.C. set of events has the disadvantage that although 
the p.d.f. could be no zero, there are phase space regions where very few (or none) 
events have been generated. The use of such a set to estimate integrals or mean values, 
by reweighting, could induce systematical biases. These biases are important when the 
particular not-covered phase space region contributes significantly to the cross section 
corresponding to the extrapolated parameter values. Thus the use of several M.C. sets 
produced at different points of the parametric space and properly combined together is 
preferable. 

Let Q{oi) be the integral of a function of the observed kinematical vectors (e.g. cross 
section or mean value). Let Sj (m = 1, . . . ,m) be m sets of M.C. events produced with 
parameter values d~i, . . . , d m . Let then Qi(d) be an estimation of Q(d) with variance 
Vi{d) where the i set of M.C. events has been used as it is described in the previous 
section. Then a better estimation of Q{d) can be found as a linear combination of the 
Qi{d) 's, i.e. 

m 

QcombXa) = £7i- (26) 
i=i 

m 

E7, = l (27) 



i=l 



such as the variance of Qcomb. (a) to be minimum. It is easy to show that this is equivalent 
to the minimization of the least square sum 

m 

X 2 = E(2 Comb .(«) - Qi{d)) 2 /Vi{d) (28) 



i=l 



which results to: 



with a variance of: 



f^i Vi(a) ^Vi(a) 



m i 

V{Q comb {d)) = l./{Y.TT7-^) (30) 

i=l 
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There are however cases, where integrals of more than one functions have to be es- 
timated (e.g. in the case of two-T.G.C. simultaneous estimation, two M.O. have to be 
evaluated as functions of the two couplings) by integration using the same sets of M.C. 
events. The generalization of ( p8[ ) for n integrals reads: 

m 

X 2 = E(Q-mb.(«) - Qi(")) T • Mr 1 • (Q comb .(a) - (31) 

i=l 

where the vectors Qcomb.(®) and Qi(a) have elements coresponding to each of the n 
integrated functions, whilst the subscript i stands for the set of the M.C. events used in 
the integration. The matrices Mj represent the covariant matrices between the elements 
of the vector Qj(a). 

4 Numerical Results 

For any practical application of the reweighting technique, one needs only the means to 
calculate the weight w(V,a,d g ) as it is defined in (|TT|). Concentrating on the physics 
analysis concerning the determination of the TGC's there are several four fermion M.C. 
generators which include the relevant physics processes ||. Among them, the ERATO 
generator |4[] has been chosen and was modified keeping only the parts necessary for the 
calculation of the squared matrix element (M.E.). In case that an ISR photon (with or 
without Pt) had been emitted, the four fermions kinematical vectors were transformed to 
their rest frame before calling the ERATO M.E. calculating routines. Here the ISR effect 
is assumed to be factorizable and thus independent of the variables. Thus, as it has been 
shown previously with the phase space factors, the ISR factors in the weight definitions 
are canceled out. 

In general lines one should a) define the values of the physics constants to be used for 
the matrix element calculation, b) pass the kinematical and event type information and 
c) call the relevant software twice to calculate the ratio of the M.E.'s as in (|TT|). 

In the following, the reweighting method is demonstrated through applications con- 
cerning the evaluation of physical quanitites needed in the estimation of the TGC's, using 
four fermion (specifically the lepton-neutrino-two hadronic jets) final state events as those 
produced at LEPII at 172 GeV centre of mass energy. 

The available M.C. events undergone full detector simulation (DELSIM ||) are 
grouped into two main categories. 

• Events produced with PYTHIA || with the TGC's set to their Standard Model 
(S.M.) values whilst six different W mass (79.35,79.85,80.23,80.35,80.85 and 81.35 
GeV/c 2 ) values were used. These six subcategories contain in all 3280 and 3390 
events having an electron (electron type) or a muon (muon type) in the final state 
respectively. 

• Events produced with EXCALIBUR with the W mass set to 80.35 GeV/c 2 but 
divided into three groups according to the aw<j> values used in the production whilst 
the other couplings are related according to the W<p model |§ . Namely the aw<f> = 
subgroup contains 790 electron type and 900 muon type events, the aw<j> = —2 
subgroup contains 770 electron type and 900 muon type events and the aw<t> = 2 
subgroup contains 810 electron type and 900 muon type events. 
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Identical selection criteria have beee applied on both generation and reconstruction 
level to all the M.C. set of events. The PYTHIA physics generator employs only the 
resonant Feynman graphs (CC03) whilst the EXCALIBUR generator includes all the four 
fermion (CC10 for muon type and CC20 for electron type) relevant processes. For all the 
applications to follow the extrapolated parameter variables correspond to the full four 
fermion production mechanisms and the extrapolated W mass value was equal to 80.35 
GeV/c 2 whilst the estimated quantities are considered as functions of the awcp coupling. 

The simplest demonstration of the reweighting technique deals with the estimation of 
the total cross section as a function of the coupling. This in accordance to ( |16|) can be 
estimated by using the j th set of the available M.C. events as: 

l i ~ ' £ w ( Vi > aw ^ ( 32 ) 

^ v o i=l 

with a statistical error equal to: 

S j = tot< ^Jl^ ■ [< w 2 (V, a W4n a 3 Wlj> ) > - < w(V, a W4n a J W(f) ) > 2 } 1/2 (33) 
V iV o 



In the following several M.C. set of events are combined as in fl29|) to increase the 
statistical accuracy of this approximation. Figure 0a and |]b presents the estimated de- 
pendence of the number of produced events for a 9.6 pb~ l accumulated luminosity on the 
value of otW(f> for electronic and muonic final states when only the PYTHIA M.C. sets 
have been reweighted. This estimation of the reweighting procedure agrees, for coupling 
values between -2 and 2, within statistical errors with the prediction (solid line) of the 
ERATO generator containing the full set of four fermion production diagrams. However, 
this extrapolation suffers from the fact that the PYTHIA generated events do not cover 
adequately all the phase space regions which are contributing significantly to the four 
fermion cross section for larger absolute values of the couplings. An enlargement of the 
extrapolation region with a simultaneous improvement of the accuracy is achieved when 
the EXCALIBUR M.C. sets are also included as it can be seen in figures |l]c and ||d. 

The second example is dealing with integrations with respect to the observed kinemat- 
ical vectors. In this case the efficiency is estimated as the fraction of the number of the 
produced events (eq. |32[), which are selected in the final analysis sample (eq. |IB|). As it is 
shown in fig. ^| the estimated efficiency by reweighting is in a very good agreement with 
the results of direct calculations using the EXCALIBUR M.C. sets produced at different 
values of otw<t>- 

The success of the reweighting technique to perform integrations with respect to the 
observed kinematical vectors can be also demonstrated by comparing estimations of the 
number of events to be observed in a special phase space region, by reweighting and 
by direct M.C. integration. In this example only the PYTHIA M.C. sets were used to 
estimate these differential quantities as functions of aw4> by employing reweighting. In 
parallel the EXCALIBUR M.C. sets were used for a direct estimation at the three available 
values of aw<t>- These integrals represent the number of the expected events for a total 
luminosity of 9.6 in bins either of cosOw (Fig [| to Fig H) {6w being the polar angle 
of the hadronic system) or of the Modified Observable (Fig |^ to Fig |8|). As a measure of 
the level of agreement in this comparison, the relative difference of the two estimations 
was chosen (relative deviation) but the estimated numbers of events are also presented 
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as insets in the previous figures. The remarkable agreement between the two type of 
estimations justifies the use of the reweighting technique in producing the distribution 
shapes for a wide area of the otw<t> to be used in fitting the TGC's. 

As extensively discussed previously, it is possible to define a Modified Observable such 
as its mean value is an estimator of the aw<p which carries the same information as the 
likelihood. It has been also shown that (independent of the optimality) the evolution 
of this quantity as a function of the coupling can be accurately approximated by the 
reweighting procedure. Indeed, as it is shown in Fig. 0, the mean value of the Modified 
Observables defined for cew<p = and estimated by reweighting is in a very good agreement 
with the direct estimations by using the EXCALIBUR M.C. sets. However the optimality 
of this method is restricted to the linear part around aw<f> = 0. The extention of this 
optimality is achieved by expanding the p.d.f. in a Taylor series around another value 
of the coupling and reformulating the Modified Observables. This procedure for optimal 
definition has been employed around the two extreme values of aw</> = ±2. The results 
are shown in Fig [10] where also for comparison the directly estimated values with the 
EXCALIBUR M.C. sets are presented. 

5 Conclusions 

Following very simple calculus, it has been shown that a M.C. set of events generated at 
specific values of the relevant physical parameters can be used to approximate physical 
quantities in a very wide region of the parametric space. Furthermore, a simple proce- 
dure has been demonstrated which combines several M.C. samples, independent of the 
specific values of the physical parameters at their generation. The quantification of the 
accuracy of this approximation has been expressed analytically and demonstrated by nu- 
merical examples. The applicability of this technique to produce the shapes of kinematical 
quantities to be used in extracting the values of physical parameters was proven. 
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Figure 1: The total Cross Section as a function of the aw^' a) and b) only the PYTHIA, 
c) and d) the PYTHIA and the EXCALIBUR Monte Carlo sets are included in the 
reweighting. 
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Figure 2: The Efficiency as a function of the aw 4, estimated by reweighting only the 
PYTHIA Monte Carlo sets. The squares represent the efficiency values estimated directly 
from the EXCALIBUR samples. 
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Figure 3: Comparison of the differential cross section with respect to the cos6 w as it is 
estimated by reweighting the PYTHIA sets with the distribution of unweighted Monte 
Carlo events produced with EXCALIBUR at aw<j> — 0. The solid lines and the black points 
in the inset figures correspond to the reweighting estimation whilst the open circles stand 
for the EXCALIBUR prediction. 
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Figure 4: Comparison of the differential cross section with respect to the cos 9 W as it is 
estimated by reweighting the PYTHIA sets with the distribution of unweighted Monte 
Carlo events produced with EXCALIBUR at a.w<j> = —2. The solid lines and the black 
points in the inset figures correspond to the reweighting estimation whilst the open circles 
stand for the EXCALIBUR prediction. 
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Figure 5: Comparison of the differential cross section with respect to the cos6 w as it is 
estimated by reweighting the PYTHIA sets with the distribution of unweighted Monte 
Carlo events produced with EXCALIBUR at aw<j> = +2. The solid lines and the black 
points in the inset figures correspond to the reweighting estimation whilst the open circles 
stand for the EXCALIBUR prediction. 
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Figure 6: Comparison of the differential cross section with respect to the Modified Ob- 
servables (defined at aw<f> = 0) as it is estimated by reweighting the PYTHIA sets with the 
distribution of unweighted Monte Carlo events produced with EXCALIBUR at aw^> = 0. 
The solid lines and the black points in the inset figures correspond to the reweighting 
estimation whilst the open circles stand for the EXCALIBUR prediction. 
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Figure 7: Comparison of the differential cross section with respect to the Modified Ob- 
servables (defined at aw<f> = 0) as it is estimated by reweighting the PYTHIA sets with the 
distribution of unweighted Monte Carlo events produced with EXCALIBUR at aw<i> = ~ 2. 
The solid lines and the black points in the inset figures correspond to the reweighting es- 
timation whilst the open circles stand for the EXCALIBUR prediction. 
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Figure 8: Comparison of the differential cross section with respect to the Modified Ob- 
servables (defined at aw<f> = 0) as it is estimated by reweighting the PYTHIA sets with the 
distribution of unweighted Monte Carlo events produced with EXCALIBUR at aw(/> = +2. 
The solid lines and the black points in the inset figures correspond to the reweighting es- 
timation whilst the open circles stand for the EXCALIBUR prediction. 
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Figure 9: The Mean Value of the Modified Observable defined at a W( p = 0, as a function 
of the a W( p. The squares correspond to unweighted EXCALIBUR events. 
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Figure 10: The Mean Value of the Modified Observable defined at (a and b) «^ equal 
to -2 and (c and d) at 2 as a function of the aw<j>- The squares correspond to unweighted 
EXCALIBUR events. 



